Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 772 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 328.7 KiB |
| Average record size in memory | 436.0 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 5 |
ACTUAL_WORTH is highly correlated with tree_3 | High correlation |
tree_3 is highly correlated with ACTUAL_WORTH | High correlation |
PROJECT_NAME_EN is highly correlated with STATUS_CODE and 1 other fields | High correlation |
STATUS_CODE is highly correlated with PROJECT_NAME_EN | High correlation |
MASTER_PROJECT_EN is highly correlated with AREA_NAME_EN and 1 other fields | High correlation |
AREA_NAME_EN is highly correlated with MASTER_PROJECT_EN | High correlation |
PROCEDURE_AREA is highly skewed (γ1 = 27.2987483) | Skewed |
CURRENT_STATUS_year_after_min_year has 171 (22.2%) zeros | Zeros |
INSTANCE_DATE_year_after_min_year has 55 (7.1%) zeros | Zeros |
CREATION_year_after_min_year has 32 (4.1%) zeros | Zeros |
Reproduction
| Analysis started | 2020-11-23 10:00:18.848656 |
|---|---|
| Analysis finished | 2020-11-23 10:00:35.806450 |
| Duration | 16.96 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 20 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.864313472 |
|---|---|
| Minimum | 0 |
| Maximum | 16 |
| Zeros | 171 |
| Zeros (%) | 22.2% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 7 |
| Q3 | 7 |
| 95-th percentile | 12.3375 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.505017589 |
|---|---|
| Coefficient of variation (CV) | 0.5976859193 |
| Kurtosis | 0.1039408593 |
| Mean | 5.864313472 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.3891867205 |
| Sum | 4527.25 |
| Variance | 12.2851483 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 7 | 453 | 58.7% | |
| 0 | 171 | 22.2% | |
| 6.75 | 36 | 4.7% | |
| 6 | 30 | 3.9% | |
| 12.75 | 29 | 3.8% | |
| 9.5 | 14 | 1.8% | |
| 11.25 | 12 | 1.6% | |
| 15 | 5 | 0.6% | |
| 6.5 | 4 | 0.5% | |
| 11 | 4 | 0.5% | |
| 16 | 3 | 0.4% | |
| 7.75 | 2 | 0.3% | |
| 10 | 2 | 0.3% | |
| 13 | 1 | 0.1% | |
| 12 | 1 | 0.1% | |
| 3.25 | 1 | 0.1% | |
| 9 | 1 | 0.1% | |
| 14 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 10.75 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 171 | 22.2% | |
| 3.25 | 1 | 0.1% | |
| 5 | 1 | 0.1% | |
| 6 | 30 | 3.9% | |
| 6.5 | 4 | 0.5% | |
| 6.75 | 36 | 4.7% | |
| 7 | 453 | 58.7% | |
| 7.75 | 2 | 0.3% | |
| 9 | 1 | 0.1% | |
| 9.5 | 14 | 1.8% |
| Value | Count | Frequency (%) | |
| 16 | 3 | 0.4% | |
| 15 | 5 | 0.6% | |
| 14 | 1 | 0.1% | |
| 13 | 1 | 0.1% | |
| 12.75 | 29 | 3.8% | |
| 12 | 1 | 0.1% | |
| 11.25 | 12 | 1.6% | |
| 11 | 4 | 0.5% | |
| 10.75 | 1 | 0.1% | |
| 10 | 2 | 0.3% |
SEPARATED_REFERENCE
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.380588217e-16 |
|---|---|
| Minimum | -0.3201492815 |
| Maximum | 3.550337086 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | -0.3201492815 |
|---|---|
| 5-th percentile | -0.3201492815 |
| Q1 | -0.3201492815 |
| median | -0.2648176994 |
| Q3 | -0.2446337362 |
| 95-th percentile | 3.550337086 |
| Maximum | 3.550337086 |
| Range | 3.870486368 |
| Interquartile range (IQR) | 0.07551554528 |
Descriptive statistics
| Standard deviation | 1 |
|---|---|
| Coefficient of variation (CV) | 7.243289401e+15 |
| Kurtosis | 8.654853986 |
| Mean | 1.380588217e-16 |
| Median Absolute Deviation (MAD) | 0.05533158209 |
| Skewness | 3.252682151 |
| Sum | 1.065814104e-13 |
| Variance | 1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -0.3201492815 | 358 | 46.4% | |
| -0.2446337362 | 315 | 40.8% | |
| 3.550337086 | 56 | 7.3% | |
| -0.2648176994 | 31 | 4.0% | |
| -0.2604281198 | 7 | 0.9% | |
| -0.2606436324 | 2 | 0.3% | |
| 1.83420995 | 2 | 0.3% | |
| -0.2605942654 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| -0.3201492815 | 358 | 46.4% | |
| -0.2648176994 | 31 | 4.0% | |
| -0.2606436324 | 2 | 0.3% | |
| -0.2605942654 | 1 | 0.1% | |
| -0.2604281198 | 7 | 0.9% | |
| -0.2446337362 | 315 | 40.8% | |
| 1.83420995 | 2 | 0.3% | |
| 3.550337086 | 56 | 7.3% |
| Value | Count | Frequency (%) | |
| 3.550337086 | 56 | 7.3% | |
| 1.83420995 | 2 | 0.3% | |
| -0.2446337362 | 315 | 40.8% | |
| -0.2604281198 | 7 | 0.9% | |
| -0.2605942654 | 1 | 0.1% | |
| -0.2606436324 | 2 | 0.3% | |
| -0.2648176994 | 31 | 4.0% | |
| -0.3201492815 | 358 | 46.4% |
INSTANCE_DATE_day_normalized
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5648086244 |
|---|---|
| Minimum | 0.03225806452 |
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0.03225806452 |
|---|---|
| 5-th percentile | 0.09677419355 |
| Q1 | 0.2580645161 |
| median | 0.6129032258 |
| Q3 | 0.8387096774 |
| 95-th percentile | 0.935483871 |
| Maximum | 1 |
| Range | 0.9677419355 |
| Interquartile range (IQR) | 0.5806451613 |
Descriptive statistics
| Standard deviation | 0.2991556079 |
|---|---|
| Coefficient of variation (CV) | 0.5296583568 |
| Kurtosis | -1.437492778 |
| Mean | 0.5648086244 |
| Median Absolute Deviation (MAD) | 0.2580645161 |
| Skewness | -0.2048474114 |
| Sum | 436.0322581 |
| Variance | 0.08949407776 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.2258064516 | 79 | 10.2% | |
| 0.8709677419 | 71 | 9.2% | |
| 0.8064516129 | 67 | 8.7% | |
| 0.9032258065 | 48 | 6.2% | |
| 0.935483871 | 40 | 5.2% | |
| 0.1612903226 | 34 | 4.4% | |
| 0.7741935484 | 33 | 4.3% | |
| 0.4193548387 | 29 | 3.8% | |
| 0.6451612903 | 29 | 3.8% | |
| 0.3548387097 | 26 | 3.4% | |
| 1 | 26 | 3.4% | |
| 0.03225806452 | 24 | 3.1% | |
| 0.8387096774 | 24 | 3.1% | |
| 0.1935483871 | 21 | 2.7% | |
| 0.4516129032 | 18 | 2.3% | |
| 0.2903225806 | 18 | 2.3% | |
| 0.2580645161 | 17 | 2.2% | |
| 0.3870967742 | 17 | 2.2% | |
| 0.7419354839 | 15 | 1.9% | |
| 0.5483870968 | 15 | 1.9% | |
| 0.4838709677 | 15 | 1.9% | |
| 0.5806451613 | 14 | 1.8% | |
| 0.3225806452 | 13 | 1.7% | |
| 0.6129032258 | 13 | 1.7% | |
| 0.7096774194 | 13 | 1.7% | |
| Other values (6) | 53 | 6.9% |
| Value | Count | Frequency (%) | |
| 0.03225806452 | 24 | 3.1% | |
| 0.06451612903 | 9 | 1.2% | |
| 0.09677419355 | 10 | 1.3% | |
| 0.1290322581 | 11 | 1.4% | |
| 0.1612903226 | 34 | 4.4% | |
| 0.1935483871 | 21 | 2.7% | |
| 0.2258064516 | 79 | 10.2% | |
| 0.2580645161 | 17 | 2.2% | |
| 0.2903225806 | 18 | 2.3% | |
| 0.3225806452 | 13 | 1.7% |
| Value | Count | Frequency (%) | |
| 1 | 26 | 3.4% | |
| 0.9677419355 | 6 | 0.8% | |
| 0.935483871 | 40 | 5.2% | |
| 0.9032258065 | 48 | 6.2% | |
| 0.8709677419 | 71 | 9.2% | |
| 0.8387096774 | 24 | 3.1% | |
| 0.8064516129 | 67 | 8.7% | |
| 0.7741935484 | 33 | 4.3% | |
| 0.7419354839 | 15 | 1.9% | |
| 0.7096774194 | 13 | 1.7% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.351036269 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 55 |
| Zeros (%) | 7.1% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.475203242 |
|---|---|
| Coefficient of variation (CV) | 0.627469368 |
| Kurtosis | -1.331067433 |
| Mean | 2.351036269 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1071890667 |
| Sum | 1815 |
| Variance | 2.176224606 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 275 | 35.6% | |
| 4 | 182 | 23.6% | |
| 3 | 163 | 21.1% | |
| 0 | 55 | 7.1% | |
| 2 | 54 | 7.0% | |
| 5 | 43 | 5.6% |
| Value | Count | Frequency (%) | |
| 0 | 55 | 7.1% | |
| 1 | 275 | 35.6% | |
| 2 | 54 | 7.0% | |
| 3 | 163 | 21.1% | |
| 4 | 182 | 23.6% | |
| 5 | 43 | 5.6% |
| Value | Count | Frequency (%) | |
| 5 | 43 | 5.6% | |
| 4 | 182 | 23.6% | |
| 3 | 163 | 21.1% | |
| 2 | 54 | 7.0% | |
| 1 | 275 | 35.6% | |
| 0 | 55 | 7.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.113989637 |
|---|---|
| Minimum | 4 |
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 11 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.338819249 |
|---|---|
| Coefficient of variation (CV) | 0.2617954559 |
| Kurtosis | 8.309939337 |
| Mean | 5.113989637 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.895972495 |
| Sum | 3948 |
| Variance | 1.79243698 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 5 | 534 | 69.2% | |
| 4 | 173 | 22.4% | |
| 10 | 26 | 3.4% | |
| 9 | 19 | 2.5% | |
| 6 | 13 | 1.7% | |
| 11 | 7 | 0.9% |
| Value | Count | Frequency (%) | |
| 4 | 173 | 22.4% | |
| 5 | 534 | 69.2% | |
| 6 | 13 | 1.7% | |
| 9 | 19 | 2.5% | |
| 10 | 26 | 3.4% | |
| 11 | 7 | 0.9% |
| Value | Count | Frequency (%) | |
| 11 | 7 | 0.9% | |
| 10 | 26 | 3.4% | |
| 9 | 19 | 2.5% | |
| 6 | 13 | 1.7% | |
| 5 | 534 | 69.2% | |
| 4 | 173 | 22.4% |
| Distinct | 477 |
|---|---|
| Distinct (%) | 61.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.973823411e-17 |
|---|---|
| Minimum | -0.101700452 |
| Maximum | 27.58029489 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | -0.101700452 |
|---|---|
| 5-th percentile | -0.101700452 |
| Q1 | -0.0647660388 |
| median | -0.0519032448 |
| Q3 | -0.03522417304 |
| 95-th percentile | 0.1234748431 |
| Maximum | 27.58029489 |
| Range | 27.68199534 |
| Interquartile range (IQR) | 0.02954186576 |
Descriptive statistics
| Standard deviation | 1 |
|---|---|
| Coefficient of variation (CV) | 1.114352215e+16 |
| Kurtosis | 753.3754654 |
| Mean | 8.973823411e-17 |
| Median Absolute Deviation (MAD) | 0.0134300181 |
| Skewness | 27.2987483 |
| Sum | 6.927791674e-14 |
| Variance | 1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -0.101700452 | 113 | 14.6% | |
| -0.09306016063 | 36 | 4.7% | |
| -0.09717458509 | 20 | 2.6% | |
| -0.0647660388 | 20 | 2.6% | |
| -0.06252993856 | 18 | 2.3% | |
| -0.05805773806 | 12 | 1.6% | |
| -0.05582163781 | 7 | 0.9% | |
| -0.05403275761 | 7 | 0.9% | |
| -0.04598279671 | 5 | 0.6% | |
| -0.04329947641 | 4 | 0.5% | |
| 0.2453721214 | 4 | 0.5% | |
| -0.05656700456 | 3 | 0.4% | |
| -0.05358553756 | 3 | 0.4% | |
| -0.04410596323 | 3 | 0.4% | |
| -0.05176833342 | 3 | 0.4% | |
| -0.03100688797 | 2 | 0.3% | |
| 0.2044350888 | 2 | 0.3% | |
| -0.06141188843 | 2 | 0.3% | |
| -0.03203698482 | 2 | 0.3% | |
| -0.02328041624 | 2 | 0.3% | |
| -0.0539790912 | 2 | 0.3% | |
| 0.01978538383 | 2 | 0.3% | |
| 0.1310000658 | 2 | 0.3% | |
| -0.05913106618 | 2 | 0.3% | |
| -0.05750318519 | 2 | 0.3% | |
| Other values (452) | 494 | 64.0% |
| Value | Count | Frequency (%) | |
| -0.101700452 | 113 | 14.6% | |
| -0.09722527003 | 1 | 0.1% | |
| -0.09717458509 | 20 | 2.6% | |
| -0.09306016063 | 36 | 4.7% | |
| -0.08416644457 | 1 | 0.1% | |
| -0.0665504468 | 1 | 0.1% | |
| -0.06542345228 | 1 | 0.1% | |
| -0.06510294458 | 2 | 0.3% | |
| -0.06507760211 | 1 | 0.1% | |
| -0.06505822257 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 27.58029489 | 1 | 0.1% | |
| 2.355325011 | 1 | 0.1% | |
| 0.3850583225 | 1 | 0.1% | |
| 0.2845948105 | 1 | 0.1% | |
| 0.2727747846 | 1 | 0.1% | |
| 0.2453721214 | 4 | 0.5% | |
| 0.2415796954 | 1 | 0.1% | |
| 0.2369971806 | 1 | 0.1% | |
| 0.2311907736 | 1 | 0.1% | |
| 0.2199923836 | 1 | 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.784974093 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 32 |
| Zeros (%) | 4.1% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 4 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.21293595 |
|---|---|
| Coefficient of variation (CV) | 0.4355286294 |
| Kurtosis | 0.5749608199 |
| Mean | 2.784974093 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.09371983771 |
| Sum | 2150 |
| Variance | 1.471213618 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 316 | 40.9% | |
| 4 | 173 | 22.4% | |
| 2 | 139 | 18.0% | |
| 1 | 88 | 11.4% | |
| 0 | 32 | 4.1% | |
| 6 | 22 | 2.8% | |
| 7 | 1 | 0.1% | |
| 5 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 32 | 4.1% | |
| 1 | 88 | 11.4% | |
| 2 | 139 | 18.0% | |
| 3 | 316 | 40.9% | |
| 4 | 173 | 22.4% | |
| 5 | 1 | 0.1% | |
| 6 | 22 | 2.8% | |
| 7 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 1 | 0.1% | |
| 6 | 22 | 2.8% | |
| 5 | 1 | 0.1% | |
| 4 | 173 | 22.4% | |
| 3 | 316 | 40.9% | |
| 2 | 139 | 18.0% | |
| 1 | 88 | 11.4% | |
| 0 | 32 | 4.1% |
PROCEDURE_NUMBER
Real number (ℝ)
| Distinct | 740 |
|---|---|
| Distinct (%) | 95.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.363137158e-17 |
|---|---|
| Minimum | -1.544780129 |
| Maximum | 6.054670009 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | -1.544780129 |
|---|---|
| 5-th percentile | -1.321203592 |
| Q1 | -0.6531152261 |
| median | -0.05516660067 |
| Q3 | 0.3007901373 |
| 95-th percentile | 1.538849492 |
| Maximum | 6.054670009 |
| Range | 7.599450138 |
| Interquartile range (IQR) | 0.9539053635 |
Descriptive statistics
| Standard deviation | 1 |
|---|---|
| Coefficient of variation (CV) | 1.358116763e+16 |
| Kurtosis | 7.729789287 |
| Mean | 7.363137158e-17 |
| Median Absolute Deviation (MAD) | 0.3942303412 |
| Skewness | 1.915247877 |
| Sum | 5.684341886e-14 |
| Variance | 1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.04388010416 | 3 | 0.4% | |
| -1.100170477 | 3 | 0.4% | |
| 0.3349062245 | 2 | 0.3% | |
| -0.1282899704 | 2 | 0.3% | |
| -0.09747544002 | 2 | 0.3% | |
| 0.2512667849 | 2 | 0.3% | |
| -0.7612106423 | 2 | 0.3% | |
| 0.2507776654 | 2 | 0.3% | |
| -1.212667969 | 2 | 0.3% | |
| -0.04367229172 | 2 | 0.3% | |
| 0.2630056536 | 2 | 0.3% | |
| -0.234918028 | 2 | 0.3% | |
| 0.253223263 | 2 | 0.3% | |
| -0.1405179587 | 2 | 0.3% | |
| -0.1292682095 | 2 | 0.3% | |
| -0.6311048473 | 2 | 0.3% | |
| -1.17940784 | 2 | 0.3% | |
| 0.05072777758 | 2 | 0.3% | |
| -0.6315939668 | 2 | 0.3% | |
| -0.1498112297 | 2 | 0.3% | |
| -0.186984314 | 2 | 0.3% | |
| 0.3339279855 | 2 | 0.3% | |
| 0.04974953852 | 2 | 0.3% | |
| -0.1278008509 | 2 | 0.3% | |
| -0.06763914869 | 2 | 0.3% | |
| Other values (715) | 720 | 93.3% |
| Value | Count | Frequency (%) | |
| -1.544780129 | 1 | 0.1% | |
| -1.542823651 | 1 | 0.1% | |
| -1.537443336 | 1 | 0.1% | |
| -1.53353038 | 1 | 0.1% | |
| -1.510052643 | 1 | 0.1% | |
| -1.509074404 | 1 | 0.1% | |
| -1.508585284 | 1 | 0.1% | |
| -1.503694089 | 1 | 0.1% | |
| -1.478748993 | 1 | 0.1% | |
| -1.467010124 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 6.054670009 | 1 | 0.1% | |
| 5.997443024 | 1 | 0.1% | |
| 5.986682394 | 1 | 0.1% | |
| 5.451585628 | 1 | 0.1% | |
| 4.447912353 | 1 | 0.1% | |
| 4.366229391 | 1 | 0.1% | |
| 4.10748516 | 1 | 0.1% | |
| 3.995476787 | 1 | 0.1% | |
| 3.974444648 | 1 | 0.1% | |
| 3.944608356 | 1 | 0.1% |
MUNC_NUMBER
Real number (ℝ)
| Distinct | 459 |
|---|---|
| Distinct (%) | 59.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1869287122 |
|---|---|
| Minimum | -2.044785387 |
| Maximum | 2.485572941 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | -2.044785387 |
|---|---|
| 5-th percentile | -1.833656426 |
| Q1 | -0.1688459216 |
| median | 0.6313914877 |
| Q3 | 0.7140836641 |
| 95-th percentile | 0.8119771257 |
| Maximum | 2.485572941 |
| Range | 4.530358328 |
| Interquartile range (IQR) | 0.8829295857 |
Descriptive statistics
| Standard deviation | 0.8168755578 |
|---|---|
| Coefficient of variation (CV) | 4.369984408 |
| Kurtosis | 1.062818206 |
| Mean | 0.1869287122 |
| Median Absolute Deviation (MAD) | 0.1606925981 |
| Skewness | -1.388334826 |
| Sum | 144.3089658 |
| Variance | 0.6672856769 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.6313914877 | 174 | 22.5% | |
| 0.3304154243 | 13 | 1.7% | |
| -0.3569266377 | 10 | 1.3% | |
| 0.7641681454 | 6 | 0.8% | |
| 0.257341345 | 6 | 0.8% | |
| 0.397859398 | 6 | 0.8% | |
| 0.815777447 | 4 | 0.5% | |
| 0.3568065445 | 4 | 0.5% | |
| 0.6969587595 | 4 | 0.5% | |
| -1.833656426 | 4 | 0.5% | |
| 0.4540431604 | 3 | 0.4% | |
| -0.19412275 | 3 | 0.4% | |
| 0.376629208 | 3 | 0.4% | |
| 0.4531048095 | 3 | 0.4% | |
| 0.2842016406 | 3 | 0.4% | |
| 0.7761321199 | 3 | 0.4% | |
| 0.7782434095 | 3 | 0.4% | |
| -1.698064716 | 3 | 0.4% | |
| 0.5746212559 | 3 | 0.4% | |
| -0.03765273107 | 3 | 0.4% | |
| 0.7106821419 | 2 | 0.3% | |
| 0.1880206695 | 2 | 0.3% | |
| -0.7780116211 | 2 | 0.3% | |
| 0.3923465862 | 2 | 0.3% | |
| 0.6525043838 | 2 | 0.3% | |
| Other values (434) | 501 | 64.9% |
| Value | Count | Frequency (%) | |
| -2.044785387 | 1 | 0.1% | |
| -2.044316212 | 1 | 0.1% | |
| -2.043847036 | 1 | 0.1% | |
| -2.043612449 | 1 | 0.1% | |
| -2.043377861 | 1 | 0.1% | |
| -2.042908685 | 1 | 0.1% | |
| -2.041970335 | 2 | 0.3% | |
| -1.91716966 | 1 | 0.1% | |
| -1.915292958 | 1 | 0.1% | |
| -1.914823782 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 2.485572941 | 1 | 0.1% | |
| 2.467275098 | 1 | 0.1% | |
| 1.12918666 | 1 | 0.1% | |
| 1.097282728 | 1 | 0.1% | |
| 1.095406027 | 1 | 0.1% | |
| 0.8270376582 | 1 | 0.1% | |
| 0.8265684828 | 2 | 0.3% | |
| 0.8251609564 | 1 | 0.1% | |
| 0.8246917809 | 1 | 0.1% | |
| 0.8242226054 | 1 | 0.1% |
MUNC_ZIP_CODE
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.602632822e-15 |
|---|---|
| Minimum | -2.092823108 |
| Maximum | 1.464762274 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | -2.092823108 |
|---|---|
| 5-th percentile | -2.092823108 |
| Q1 | -0.1865599311 |
| median | -0.1865599311 |
| Q3 | 0.01043991086 |
| 95-th percentile | 1.464762274 |
| Maximum | 1.464762274 |
| Range | 3.557585381 |
| Interquartile range (IQR) | 0.196999842 |
Descriptive statistics
| Standard deviation | 1 |
|---|---|
| Coefficient of variation (CV) | -6.239732434e+14 |
| Kurtosis | 0.1108089394 |
| Mean | -1.602632822e-15 |
| Median Absolute Deviation (MAD) | 0.196999842 |
| Skewness | -0.3258195182 |
| Sum | -1.237232539e-12 |
| Variance | 1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| -0.1865599311 | 315 | 40.8% | |
| 0.01043991086 | 186 | 24.1% | |
| 1.464762274 | 171 | 22.2% | |
| -2.092823108 | 56 | 7.3% | |
| -1.918999718 | 31 | 4.0% | |
| -1.832088023 | 9 | 1.2% | |
| -0.823912361 | 2 | 0.3% | |
| -0.2618834001 | 1 | 0.1% | |
| 1.435791709 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| -2.092823108 | 56 | 7.3% | |
| -1.918999718 | 31 | 4.0% | |
| -1.832088023 | 9 | 1.2% | |
| -0.823912361 | 2 | 0.3% | |
| -0.2618834001 | 1 | 0.1% | |
| -0.1865599311 | 315 | 40.8% | |
| 0.01043991086 | 186 | 24.1% | |
| 1.435791709 | 1 | 0.1% | |
| 1.464762274 | 171 | 22.2% |
| Value | Count | Frequency (%) | |
| 1.464762274 | 171 | 22.2% | |
| 1.435791709 | 1 | 0.1% | |
| 0.01043991086 | 186 | 24.1% | |
| -0.1865599311 | 315 | 40.8% | |
| -0.2618834001 | 1 | 0.1% | |
| -0.823912361 | 2 | 0.3% | |
| -1.832088023 | 9 | 1.2% | |
| -1.918999718 | 31 | 4.0% | |
| -2.092823108 | 56 | 7.3% |
| Distinct | 501 |
|---|---|
| Distinct (%) | 64.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5868378.021 |
|---|---|
| Minimum | 1235000 |
| Maximum | 105027384 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 6.2 KiB |
Quantile statistics
| Minimum | 1235000 |
|---|---|
| 5-th percentile | 1300000 |
| Q1 | 2981250 |
| median | 3868388 |
| Q3 | 4770888 |
| 95-th percentile | 28122560 |
| Maximum | 105027384 |
| Range | 103792384 |
| Interquartile range (IQR) | 1789638 |
Descriptive statistics
| Standard deviation | 9268864.308 |
|---|---|
| Coefficient of variation (CV) | 1.579459312 |
| Kurtosis | 28.95516968 |
| Mean | 5868378.021 |
| Median Absolute Deviation (MAD) | 903500 |
| Skewness | 4.691892838 |
| Sum | 4530387832 |
| Variance | 8.591184557e+13 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1300000 | 47 | 6.1% | |
| 1274000 | 26 | 3.4% | |
| 1500000 | 19 | 2.5% | |
| 1400000 | 13 | 1.7% | |
| 1372000 | 12 | 1.6% | |
| 1470000 | 10 | 1.3% | |
| 3803888 | 10 | 1.3% | |
| 3476888 | 8 | 1.0% | |
| 1450000 | 6 | 0.8% | |
| 1350000 | 5 | 0.6% | |
| 1235000 | 5 | 0.6% | |
| 4718888 | 4 | 0.5% | |
| 4000000 | 4 | 0.5% | |
| 1568000 | 4 | 0.5% | |
| 4071888 | 4 | 0.5% | |
| 1261000 | 4 | 0.5% | |
| 2800000 | 4 | 0.5% | |
| 41100000 | 4 | 0.5% | |
| 4777888 | 3 | 0.4% | |
| 3505888 | 3 | 0.4% | |
| 3700000 | 3 | 0.4% | |
| 3486888 | 3 | 0.4% | |
| 3524888 | 3 | 0.4% | |
| 2550000 | 3 | 0.4% | |
| 18000000 | 3 | 0.4% | |
| Other values (476) | 562 | 72.8% |
| Value | Count | Frequency (%) | |
| 1235000 | 5 | 0.6% | |
| 1261000 | 4 | 0.5% | |
| 1266703 | 1 | 0.1% | |
| 1274000 | 26 | 3.4% | |
| 1287000 | 1 | 0.1% | |
| 1300000 | 47 | 6.1% | |
| 1330000 | 1 | 0.1% | |
| 1336500 | 1 | 0.1% | |
| 1350000 | 5 | 0.6% | |
| 1358000 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 105027384 | 1 | 0.1% | |
| 72710000 | 1 | 0.1% | |
| 72000000 | 1 | 0.1% | |
| 52539955 | 1 | 0.1% | |
| 51100000 | 1 | 0.1% | |
| 49382900 | 1 | 0.1% | |
| 42800000 | 1 | 0.1% | |
| 41100000 | 4 | 0.5% | |
| 40017620 | 1 | 0.1% | |
| 39708860 | 1 | 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| Hadaeq Sheikh Mohammed Bin Rashid | |
|---|---|
| Wadi Al Safa 7 | |
| Al Yufrah 2 | |
| Island 2 | |
| Jumeirah First | 31 |
| Value | Count | Frequency (%) | |
| Hadaeq Sheikh Mohammed Bin Rashid | 315 | 40.8% | |
| Wadi Al Safa 7 | 186 | 24.1% | |
| Al Yufrah 2 | 171 | 22.2% | |
| Island 2 | 56 | 7.3% | |
| Jumeirah First | 31 | 4.0% | |
| Rare cases | 13 | 1.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 33 |
|---|---|
| Median length | 14 |
| Mean length | 20.58549223 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2260 | 14.2% | ||
| a | 2102 | 13.2% | |
| h | 1462 | 9.2% | |
| i | 1193 | 7.5% | |
| d | 1187 | 7.5% | |
| e | 1002 | 6.3% | |
| m | 661 | 4.2% | |
| S | 501 | 3.2% | |
| s | 428 | 2.7% | |
| l | 413 | 2.6% | |
| n | 371 | 2.3% | |
| A | 357 | 2.2% | |
| f | 357 | 2.2% | |
| R | 328 | 2.1% | |
| H | 315 | 2.0% | |
| q | 315 | 2.0% | |
| k | 315 | 2.0% | |
| M | 315 | 2.0% | |
| o | 315 | 2.0% | |
| B | 315 | 2.0% | |
| r | 246 | 1.5% | |
| 2 | 227 | 1.4% | |
| u | 202 | 1.3% | |
| W | 186 | 1.2% | |
| 7 | 186 | 1.2% | |
| Other values (6) | 333 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 10613 | 66.8% | |
| Uppercase Letter | 2606 | 16.4% | |
| Space Separator | 2260 | 14.2% | |
| Decimal Number | 413 | 2.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 501 | 19.2% | |
| A | 357 | 13.7% | |
| R | 328 | 12.6% | |
| H | 315 | 12.1% | |
| M | 315 | 12.1% | |
| B | 315 | 12.1% | |
| W | 186 | 7.1% | |
| Y | 171 | 6.6% | |
| I | 56 | 2.1% | |
| J | 31 | 1.2% | |
| F | 31 | 1.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 2102 | 19.8% | |
| h | 1462 | 13.8% | |
| i | 1193 | 11.2% | |
| d | 1187 | 11.2% | |
| e | 1002 | 9.4% | |
| m | 661 | 6.2% | |
| s | 428 | 4.0% | |
| l | 413 | 3.9% | |
| n | 371 | 3.5% | |
| f | 357 | 3.4% | |
| q | 315 | 3.0% | |
| k | 315 | 3.0% | |
| o | 315 | 3.0% | |
| r | 246 | 2.3% | |
| u | 202 | 1.9% | |
| t | 31 | 0.3% | |
| c | 13 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 2260 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 227 | 55.0% | |
| 7 | 186 | 45.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 13219 | 83.2% | |
| Common | 2673 | 16.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 2102 | 15.9% | |
| h | 1462 | 11.1% | |
| i | 1193 | 9.0% | |
| d | 1187 | 9.0% | |
| e | 1002 | 7.6% | |
| m | 661 | 5.0% | |
| S | 501 | 3.8% | |
| s | 428 | 3.2% | |
| l | 413 | 3.1% | |
| n | 371 | 2.8% | |
| A | 357 | 2.7% | |
| f | 357 | 2.7% | |
| R | 328 | 2.5% | |
| H | 315 | 2.4% | |
| q | 315 | 2.4% | |
| k | 315 | 2.4% | |
| M | 315 | 2.4% | |
| o | 315 | 2.4% | |
| B | 315 | 2.4% | |
| r | 246 | 1.9% | |
| u | 202 | 1.5% | |
| W | 186 | 1.4% | |
| Y | 171 | 1.3% | |
| I | 56 | 0.4% | |
| J | 31 | 0.2% | |
| Other values (3) | 75 | 0.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2260 | 84.5% | ||
| 2 | 227 | 8.5% | |
| 7 | 186 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 15892 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2260 | 14.2% | ||
| a | 2102 | 13.2% | |
| h | 1462 | 9.2% | |
| i | 1193 | 7.5% | |
| d | 1187 | 7.5% | |
| e | 1002 | 6.3% | |
| m | 661 | 4.2% | |
| S | 501 | 3.2% | |
| s | 428 | 2.7% | |
| l | 413 | 2.6% | |
| n | 371 | 2.3% | |
| A | 357 | 2.2% | |
| f | 357 | 2.2% | |
| R | 328 | 2.1% | |
| H | 315 | 2.0% | |
| q | 315 | 2.0% | |
| k | 315 | 2.0% | |
| M | 315 | 2.0% | |
| o | 315 | 2.0% | |
| B | 315 | 2.0% | |
| r | 246 | 1.5% | |
| 2 | 227 | 1.4% | |
| u | 202 | 1.3% | |
| W | 186 | 1.2% | |
| 7 | 186 | 1.2% | |
| Other values (6) | 333 | 2.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| VACANT | |
|---|---|
| PREMISED |
| Value | Count | Frequency (%) | |
| VACANT | 672 | 87.0% | |
| PREMISED | 100 | 13.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.259067358 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| A | 1344 | 27.8% | |
| V | 672 | 13.9% | |
| C | 672 | 13.9% | |
| N | 672 | 13.9% | |
| T | 672 | 13.9% | |
| E | 200 | 4.1% | |
| P | 100 | 2.1% | |
| R | 100 | 2.1% | |
| M | 100 | 2.1% | |
| I | 100 | 2.1% | |
| S | 100 | 2.1% | |
| D | 100 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 4832 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 1344 | 27.8% | |
| V | 672 | 13.9% | |
| C | 672 | 13.9% | |
| N | 672 | 13.9% | |
| T | 672 | 13.9% | |
| E | 200 | 4.1% | |
| P | 100 | 2.1% | |
| R | 100 | 2.1% | |
| M | 100 | 2.1% | |
| I | 100 | 2.1% | |
| S | 100 | 2.1% | |
| D | 100 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4832 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 1344 | 27.8% | |
| V | 672 | 13.9% | |
| C | 672 | 13.9% | |
| N | 672 | 13.9% | |
| T | 672 | 13.9% | |
| E | 200 | 4.1% | |
| P | 100 | 2.1% | |
| R | 100 | 2.1% | |
| M | 100 | 2.1% | |
| I | 100 | 2.1% | |
| S | 100 | 2.1% | |
| D | 100 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4832 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| A | 1344 | 27.8% | |
| V | 672 | 13.9% | |
| C | 672 | 13.9% | |
| N | 672 | 13.9% | |
| T | 672 | 13.9% | |
| E | 200 | 4.1% | |
| P | 100 | 2.1% | |
| R | 100 | 2.1% | |
| M | 100 | 2.1% | |
| I | 100 | 2.1% | |
| S | 100 | 2.1% | |
| D | 100 | 2.1% |
PRE_REGISTRATION_NUMBER
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| Rare cases | |
|---|---|
| Unknown | 44 |
| Value | Count | Frequency (%) | |
| Rare cases | 728 | 94.3% | |
| Unknown | 44 | 5.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.829015544 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 1456 | 19.2% | |
| e | 1456 | 19.2% | |
| s | 1456 | 19.2% | |
| R | 728 | 9.6% | |
| r | 728 | 9.6% | |
| 728 | 9.6% | ||
| c | 728 | 9.6% | |
| n | 132 | 1.7% | |
| U | 44 | 0.6% | |
| k | 44 | 0.6% | |
| o | 44 | 0.6% | |
| w | 44 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 6088 | 80.2% | |
| Uppercase Letter | 772 | 10.2% | |
| Space Separator | 728 | 9.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| R | 728 | 94.3% | |
| U | 44 | 5.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 1456 | 23.9% | |
| e | 1456 | 23.9% | |
| s | 1456 | 23.9% | |
| r | 728 | 12.0% | |
| c | 728 | 12.0% | |
| n | 132 | 2.2% | |
| k | 44 | 0.7% | |
| o | 44 | 0.7% | |
| w | 44 | 0.7% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 728 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 6860 | 90.4% | |
| Common | 728 | 9.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 1456 | 21.2% | |
| e | 1456 | 21.2% | |
| s | 1456 | 21.2% | |
| R | 728 | 10.6% | |
| r | 728 | 10.6% | |
| c | 728 | 10.6% | |
| n | 132 | 1.9% | |
| U | 44 | 0.6% | |
| k | 44 | 0.6% | |
| o | 44 | 0.6% | |
| w | 44 | 0.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 728 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 7588 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 1456 | 19.2% | |
| e | 1456 | 19.2% | |
| s | 1456 | 19.2% | |
| R | 728 | 9.6% | |
| r | 728 | 9.6% | |
| 728 | 9.6% | ||
| c | 728 | 9.6% | |
| n | 132 | 1.7% | |
| U | 44 | 0.6% | |
| k | 44 | 0.6% | |
| o | 44 | 0.6% | |
| w | 44 | 0.6% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| SIDRA | |
|---|---|
| AQUILEGIA @ AKOYA OXYGEN | |
| ARABIAN RANCHES- AZALEA COMMUNITY | |
| ARABIAN RANCHES - SAMARA COMMUNITY | |
| Unknown |
| Value | Count | Frequency (%) | |
| SIDRA | 319 | 41.3% | |
| AQUILEGIA @ AKOYA OXYGEN | 171 | 22.2% | |
| ARABIAN RANCHES- AZALEA COMMUNITY | 105 | 13.6% | |
| ARABIAN RANCHES - SAMARA COMMUNITY | 81 | 10.5% | |
| Unknown | 78 | 10.1% | |
| BV Mansions | 18 | 2.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 34 |
|---|---|
| Median length | 7 |
| Mean length | 16.42487047 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| A | 2305 | 18.2% | |
| 1188 | 9.4% | ||
| I | 1033 | 8.1% | |
| R | 772 | 6.1% | |
| N | 729 | 5.7% | |
| E | 633 | 5.0% | |
| S | 586 | 4.6% | |
| O | 528 | 4.2% | |
| Y | 528 | 4.2% | |
| M | 471 | 3.7% | |
| U | 435 | 3.4% | |
| C | 372 | 2.9% | |
| G | 342 | 2.7% | |
| D | 319 | 2.5% | |
| L | 276 | 2.2% | |
| n | 270 | 2.1% | |
| B | 204 | 1.6% | |
| H | 186 | 1.5% | |
| - | 186 | 1.5% | |
| T | 186 | 1.5% | |
| Q | 171 | 1.3% | |
| @ | 171 | 1.3% | |
| K | 171 | 1.3% | |
| X | 171 | 1.3% | |
| Z | 105 | 0.8% | |
| Other values (7) | 342 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 10541 | 83.1% | |
| Space Separator | 1188 | 9.4% | |
| Lowercase Letter | 594 | 4.7% | |
| Dash Punctuation | 186 | 1.5% | |
| Other Punctuation | 171 | 1.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 2305 | 21.9% | |
| I | 1033 | 9.8% | |
| R | 772 | 7.3% | |
| N | 729 | 6.9% | |
| E | 633 | 6.0% | |
| S | 586 | 5.6% | |
| O | 528 | 5.0% | |
| Y | 528 | 5.0% | |
| M | 471 | 4.5% | |
| U | 435 | 4.1% | |
| C | 372 | 3.5% | |
| G | 342 | 3.2% | |
| D | 319 | 3.0% | |
| L | 276 | 2.6% | |
| B | 204 | 1.9% | |
| H | 186 | 1.8% | |
| T | 186 | 1.8% | |
| Q | 171 | 1.6% | |
| K | 171 | 1.6% | |
| X | 171 | 1.6% | |
| Z | 105 | 1.0% | |
| V | 18 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1188 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| @ | 171 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 270 | 45.5% | |
| o | 96 | 16.2% | |
| k | 78 | 13.1% | |
| w | 78 | 13.1% | |
| s | 36 | 6.1% | |
| a | 18 | 3.0% | |
| i | 18 | 3.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 186 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 11135 | 87.8% | |
| Common | 1545 | 12.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 2305 | 20.7% | |
| I | 1033 | 9.3% | |
| R | 772 | 6.9% | |
| N | 729 | 6.5% | |
| E | 633 | 5.7% | |
| S | 586 | 5.3% | |
| O | 528 | 4.7% | |
| Y | 528 | 4.7% | |
| M | 471 | 4.2% | |
| U | 435 | 3.9% | |
| C | 372 | 3.3% | |
| G | 342 | 3.1% | |
| D | 319 | 2.9% | |
| L | 276 | 2.5% | |
| n | 270 | 2.4% | |
| B | 204 | 1.8% | |
| H | 186 | 1.7% | |
| T | 186 | 1.7% | |
| Q | 171 | 1.5% | |
| K | 171 | 1.5% | |
| X | 171 | 1.5% | |
| Z | 105 | 0.9% | |
| o | 96 | 0.9% | |
| k | 78 | 0.7% | |
| w | 78 | 0.7% | |
| Other values (4) | 90 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1188 | 76.9% | ||
| - | 186 | 12.0% | |
| @ | 171 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 12680 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| A | 2305 | 18.2% | |
| 1188 | 9.4% | ||
| I | 1033 | 8.1% | |
| R | 772 | 6.1% | |
| N | 729 | 5.7% | |
| E | 633 | 5.0% | |
| S | 586 | 4.6% | |
| O | 528 | 4.2% | |
| Y | 528 | 4.2% | |
| M | 471 | 3.7% | |
| U | 435 | 3.4% | |
| C | 372 | 2.9% | |
| G | 342 | 2.7% | |
| D | 319 | 2.5% | |
| L | 276 | 2.2% | |
| n | 270 | 2.1% | |
| B | 204 | 1.6% | |
| H | 186 | 1.5% | |
| - | 186 | 1.5% | |
| T | 186 | 1.5% | |
| Q | 171 | 1.3% | |
| @ | 171 | 1.3% | |
| K | 171 | 1.3% | |
| X | 171 | 1.3% | |
| Z | 105 | 0.8% | |
| Other values (7) | 342 | 2.7% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.2 KiB |
| Unknown | |
|---|---|
| 558 Villa | |
| Dubai Tiger Woods |
| Value | Count | Frequency (%) | |
| Unknown | 415 | 53.8% | |
| 558 Villa | 186 | 24.1% | |
| Dubai Tiger Woods | 171 | 22.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 17 |
|---|---|
| Median length | 7 |
| Mean length | 9.696891192 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 1245 | 16.6% | |
| o | 757 | 10.1% | |
| i | 528 | 7.1% | |
| 528 | 7.1% | ||
| U | 415 | 5.5% | |
| k | 415 | 5.5% | |
| w | 415 | 5.5% | |
| 5 | 372 | 5.0% | |
| l | 372 | 5.0% | |
| a | 357 | 4.8% | |
| 8 | 186 | 2.5% | |
| V | 186 | 2.5% | |
| D | 171 | 2.3% | |
| u | 171 | 2.3% | |
| b | 171 | 2.3% | |
| T | 171 | 2.3% | |
| g | 171 | 2.3% | |
| e | 171 | 2.3% | |
| r | 171 | 2.3% | |
| W | 171 | 2.3% | |
| d | 171 | 2.3% | |
| s | 171 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5286 | 70.6% | |
| Uppercase Letter | 1114 | 14.9% | |
| Decimal Number | 558 | 7.5% | |
| Space Separator | 528 | 7.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| U | 415 | 37.3% | |
| V | 186 | 16.7% | |
| D | 171 | 15.4% | |
| T | 171 | 15.4% | |
| W | 171 | 15.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 1245 | 23.6% | |
| o | 757 | 14.3% | |
| i | 528 | 10.0% | |
| k | 415 | 7.9% | |
| w | 415 | 7.9% | |
| l | 372 | 7.0% | |
| a | 357 | 6.8% | |
| u | 171 | 3.2% | |
| b | 171 | 3.2% | |
| g | 171 | 3.2% | |
| e | 171 | 3.2% | |
| r | 171 | 3.2% | |
| d | 171 | 3.2% | |
| s | 171 | 3.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 528 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 5 | 372 | 66.7% | |
| 8 | 186 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 6400 | 85.5% | |
| Common | 1086 | 14.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 1245 | 19.5% | |
| o | 757 | 11.8% | |
| i | 528 | 8.2% | |
| U | 415 | 6.5% | |
| k | 415 | 6.5% | |
| w | 415 | 6.5% | |
| l | 372 | 5.8% | |
| a | 357 | 5.6% | |
| V | 186 | 2.9% | |
| D | 171 | 2.7% | |
| u | 171 | 2.7% | |
| b | 171 | 2.7% | |
| T | 171 | 2.7% | |
| g | 171 | 2.7% | |
| e | 171 | 2.7% | |
| r | 171 | 2.7% | |
| W | 171 | 2.7% | |
| d | 171 | 2.7% | |
| s | 171 | 2.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 528 | 48.6% | ||
| 5 | 372 | 34.3% | |
| 8 | 186 | 17.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 7486 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 1245 | 16.6% | |
| o | 757 | 10.1% | |
| i | 528 | 7.1% | |
| 528 | 7.1% | ||
| U | 415 | 5.5% | |
| k | 415 | 5.5% | |
| w | 415 | 5.5% | |
| 5 | 372 | 5.0% | |
| l | 372 | 5.0% | |
| a | 357 | 4.8% | |
| 8 | 186 | 2.5% | |
| V | 186 | 2.5% | |
| D | 171 | 2.3% | |
| u | 171 | 2.3% | |
| b | 171 | 2.3% | |
| T | 171 | 2.3% | |
| g | 171 | 2.3% | |
| e | 171 | 2.3% | |
| r | 171 | 2.3% | |
| W | 171 | 2.3% | |
| d | 171 | 2.3% | |
| s | 171 | 2.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CURRENT_STATUS_year_after_min_year | SEPARATED_REFERENCE | INSTANCE_DATE_day_normalized | INSTANCE_DATE_year_after_min_year | tree_3 | PROCEDURE_AREA | CREATION_year_after_min_year | PROCEDURE_NUMBER | MUNC_NUMBER | MUNC_ZIP_CODE | ACTUAL_WORTH | AREA_NAME_EN | STATUS_CODE | PRE_REGISTRATION_NUMBER | PROJECT_NAME_EN | MASTER_PROJECT_EN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.0 | -0.320149 | 0.032258 | 3 | 4 | -0.101700 | 4 | -0.986695 | 0.631391 | 1.464762 | 1300000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 1 | 0.0 | -0.320149 | 0.935484 | 3 | 4 | -0.093060 | 4 | -1.176473 | 0.631391 | 1.464762 | 1568000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 2 | 0.0 | -0.320149 | 0.774194 | 3 | 4 | -0.101700 | 4 | -0.798873 | 0.631391 | 1.464762 | 1300000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 3 | 0.0 | -0.320149 | 0.193548 | 4 | 4 | -0.093060 | 4 | 0.253223 | 0.631391 | 1.464762 | 1450000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 4 | 7.0 | -0.244634 | 0.838710 | 1 | 5 | -0.032940 | 3 | 0.271321 | 0.675025 | -0.186560 | 4891888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
| 5 | 7.0 | -0.244634 | 0.225806 | 1 | 5 | -0.058053 | 3 | -0.128290 | 0.697545 | -0.186560 | 3503888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
| 6 | 7.0 | -0.244634 | 0.870968 | 1 | 5 | -0.039808 | 3 | 0.287951 | 0.808740 | -0.186560 | 4707888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
| 7 | 0.0 | -0.320149 | 0.161290 | 3 | 4 | -0.101700 | 4 | 0.029696 | 0.631391 | 1.464762 | 1274000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 8 | 6.0 | -0.264818 | 0.806452 | 1 | 5 | 0.039590 | 0 | 5.997443 | -0.664002 | -1.919000 | 6047337.0 | Jumeirah First | VACANT | Unknown | Unknown | Unknown |
| 9 | 7.0 | -0.244634 | 0.032258 | 1 | 5 | -0.034953 | 3 | -0.381165 | -0.725933 | -0.186560 | 5043888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
Last rows
| CURRENT_STATUS_year_after_min_year | SEPARATED_REFERENCE | INSTANCE_DATE_day_normalized | INSTANCE_DATE_year_after_min_year | tree_3 | PROCEDURE_AREA | CREATION_year_after_min_year | PROCEDURE_NUMBER | MUNC_NUMBER | MUNC_ZIP_CODE | ACTUAL_WORTH | AREA_NAME_EN | STATUS_CODE | PRE_REGISTRATION_NUMBER | PROJECT_NAME_EN | MASTER_PROJECT_EN | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 762 | 7.00 | -0.244634 | 0.806452 | 1 | 5 | -0.039701 | 3 | 0.241484 | 0.809678 | -0.186560 | 4097888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
| 763 | 7.00 | -0.244634 | 0.225806 | 1 | 5 | -0.027870 | 3 | -0.159594 | 0.772613 | -0.186560 | 5182888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |
| 764 | 7.00 | -0.320149 | 0.548387 | 2 | 5 | -0.006627 | 1 | 1.426621 | 0.020173 | 0.010440 | 3901888.0 | Wadi Al Safa 7 | PREMISED | Rare cases | ARABIAN RANCHES - SAMARA COMMUNITY | 558 Villa |
| 765 | 0.00 | -0.320149 | 0.870968 | 3 | 4 | -0.097175 | 4 | -0.892784 | 0.631391 | 1.464762 | 1500000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 766 | 7.00 | -0.320149 | 0.161290 | 2 | 5 | -0.055822 | 2 | 0.912067 | -0.330418 | 0.010440 | 4113888.0 | Wadi Al Safa 7 | VACANT | Rare cases | ARABIAN RANCHES- AZALEA COMMUNITY | 558 Villa |
| 767 | 0.00 | -0.320149 | 0.935484 | 3 | 4 | -0.093060 | 4 | -1.170115 | 0.631391 | 1.464762 | 1372000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 768 | 7.75 | -0.320149 | 0.580645 | 0 | 5 | -0.059131 | 1 | 0.922339 | 0.538377 | 0.010440 | 3069799.0 | Wadi Al Safa 7 | PREMISED | Rare cases | ARABIAN RANCHES - SAMARA COMMUNITY | 558 Villa |
| 769 | 6.00 | -0.264818 | 0.967742 | 1 | 5 | 0.018718 | 0 | 0.272788 | 0.028970 | -1.919000 | 5941806.0 | Jumeirah First | VACANT | Unknown | Unknown | Unknown |
| 770 | 0.00 | -0.320149 | 0.806452 | 3 | 4 | -0.097175 | 4 | -0.072041 | 0.631391 | 1.464762 | 1500000.0 | Al Yufrah 2 | VACANT | Rare cases | AQUILEGIA @ AKOYA OXYGEN | Dubai Tiger Woods |
| 771 | 7.00 | -0.244634 | 0.032258 | 1 | 5 | -0.045983 | 3 | -0.387034 | -0.736724 | -0.186560 | 4580888.0 | Hadaeq Sheikh Mohammed Bin Rashid | VACANT | Rare cases | SIDRA | Unknown |